Peirce ’ s i and Cohen ’ s κ for 2 × 2 Measures of Rater Reliability

نویسندگان

  • Beau Abar
  • Eric Loken
  • Junbin B. Gao
چکیده

This study examined a historical mixture model approach to the evaluation of ratings made in “gold standard” and two-rater 2 × 2 contingency tables. Peirce’s i and the derived i average were discussed in relation to a widely used index of reliability in the behavioral sciences, Cohen’s κ. Sample size, population base rate of occurrence, the true “science of the method”, and guessing rates were manipulated across simulations. In “gold standard” situations, Peirce’s i tended to recover the true reliability of ratings as well as better than κ. In two-rater situations, iave tended to recover the true reliability as well as better than κ in most situations. The empirical utility and potential theoretical benefits of mixture model methods in estimating reliability are discussed, as are the associations between the i statistics and other modern mixture model approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing reperfusion with whole-brain arterial spin labeling: a noninvasive alternative to gadolinium.

BACKGROUND AND PURPOSE Arterial spin labeling (ASL) is a perfusion imaging technique that does not require gadolinium. The study aimed to assess the reliability of ASL for evaluating reperfusion in acute ischemic stroke in comparison with dynamic susceptibility contrast (DSC) imaging. METHODS The study included 24 patients with acute ischemic stroke on admission and 24-hour follow-up ASL and ...

متن کامل

Use of hand-held Doppler ultrasound examination by podiatrists: a reliability study

BACKGROUND Hand held Doppler examination is a frequently used non-invasive vascular assessment utilised by podiatrists. Despite this, the reliability of hand-held Doppler has not been thoroughly investigated. Given the importance of Doppler in completing a vascular assessment of the lower limb, it is essential to determine the reliability of the interpretation of this testing method in practici...

متن کامل

Inter-rater reliability of AMSTAR is dependent on the pair of reviewers

BACKGROUND Inter-rater reliability (IRR) is mainly assessed based on only two reviewers of unknown expertise. The aim of this paper is to examine differences in the IRR of the Assessment of Multiple Systematic Reviews (AMSTAR) and R(evised)-AMSTAR depending on the pair of reviewers. METHODS Five reviewers independently applied AMSTAR and R-AMSTAR to 16 systematic reviews (eight Cochrane revie...

متن کامل

One-two-triage: validation and reliability of a novel triage system for low-resource settings

OBJECTIVES To validate and assess reliability of a novel triage system, one-two-triage (OTT), that can be applied by inexperienced providers in low-resource settings. METHODS This study was a two-phase prospective, comparative study conducted at three hospitals. Phase I assessed criterion validity of OTT on all patients arriving at an American university hospital by comparing agreement among ...

متن کامل

Bladder Prolapse Configuration on Baseline Standing Cystogram Can Predict Anterior Vaginal Wall Suspension Procedure Outcomes.

OBJECTIVE To evaluate whether bladder prolapse shape on lateral voiding cystourethrogram (VCUG) is an accurate predictor of anterior vaginal wall suspension (AVWS) procedure outcomes. METHODS Following an institutional review board approval, preoperative lateral standing VCUG views from a prospectively maintained database of women who underwent AVWS for stage ≥2 cystocele were reviewed retros...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010